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Introduction 

Natural  products  produced  by  bacteria  and  fungi  encompass  a  broad  range  of  bioactivity  and  are 
an  important  source  of  therapeutics  in  use  today.  Cancer  therapy  is  no  exception  to  this.  One  prominent 
example  is  doxorubicin,  a  compound  naturally  produced  by  a  soil-dwelling  bacterium,  and  is  a  common 
component  in  chemotherapy  regimens  for  many  types  of  cancers  including  breast  cancer.  Due  to  the 
chemical  complexity  of  these  molecules  which  often  prohibits  de  novo  chemical  synthesis,  biosynthesis 
is  an  attractive  approach  to  produce  and  generate  analogs  of  these  bioactive  compounds.  SF2575, 
studied  here,  is  a  tetracycline  family  natural  product  which  was  found  to  have  exceptionally  potent 
anticancer  activity  both  through  an  in  vitro  cytotoxicity  assay  with  P388  leukemia  cells,  and  during  an  in 
vivo  assay  using  mouse  xenografts1.  More  recently,  a  60-cell  line  screening  by  the  National  Cancer 
Institute  demonstrated  the  potent  activity  of  SF2575  against  nearly  all  types  of  cancer  cell  lines  tested, 
resulting  in  an  average  IC50  value  of  1 1.2  nM.  The  mechanism  of  action  of  several  closely  related 
SF2575  analogs,  TAN-15 18A  and  TAN-15 18B  has  been  identified  as  inhibition  of  DNA  topoisomerase 
I2.  As  these  structures  are  nearly  identical  to  that  of  SF2575,  it  is  likely  that  this  family  of  compounds 
shares  a  common  molecular  target.  As  key  enzymes  during  DNA  replication,  both  DNA  topoisomerase 
I  and  II  are  known  targets  for  current  anticancer  therapies  in  clinical  use  such  as  doxorubicin,  a 
topoisomerase  II  poison3,  and  camptothecin  derivatives  which  target  topoisomerase  I4. 

During  this  study,  we  have  identified  and  sequenced  the  ssf  gene  cluster  responsible  for  the 
biosynthesis  of  SF2575  from  Streptomyces  sp.  SF2575.  This  ssf  gene  cluster  is  only  the  third 
tetracycline  family  gene  cluster  to  be  identified  and  sequenced.  The  genetic  information  offers  valuable 
opportunities  to  further  enhance  our  understanding  of  tetracycline  biosynthesis,  and  to  investigate  an 
entirely  new  set  of  tetracycline  tailoring  modifications  that  distinguish  SF2575  from  the  previously 
studied  oxytetracycline  and  chlorotetracycline  (Figure  1).  SF2575  has  a  much  more  complex  and 
heavily  decorated  structure  than  previously  studied  tetracyclines  which  is  reflected  in  the  size  of  the 
sequenced  gene  cluster  which  was  nearly  double  that  of  oxytetracycline.  Bioinformatic  analysis  of  the 
gene  cluster  along  with  identification  of  the  possible  biosynthetic  intermediates  from  S.  sp.  SF2575 
fermentation  extract  led  to  the  assembly  of  a  putative  biosynthetic  pathway  for  synthesis  of  SF2575.  We 
have  reconstituted  the  early  portions  of  the  ssf  pathway  in  a  heterologous  host  and  demonstrated  the 
biosynthesis  of  the  tetracycline  core  parallels  the  previously  published  biosynthetic  pathway  of 
oxytetracycline5'7.  Additionally,  we  have  identified  key  tailoring  enzymes  involved  in  the  attachment  of 
salicylate  to  the  tetracycline  aglycon  and  verified  their  function  through  in  vitro  analysis.  Elucidation  of 
the  SF2575  biosynthetic  pathway  and  characterization  of  these  key  tailoring  enzymes  is  an  important 
first  step  toward  structure  activity  relationship  studies  and  engineered  biosynthesis  of  new  anticancer 
compounds. 


1  SF2575  R-i  =  Me  R2  =  Me 

2  TAN-1518  A  R1  =  H  R2  =  Me 

3  TAN-1518  B  R,  =  Me  R2  =  Et 


Oxytetracycline 

4 


Chlorotetracycline 

5 


Figure  1:  Natural  product  tetracyclines:  SF2575  1  produced  by  S.  sp.  SF2575;  TAN-1518A  2  and  TAN-1518B  3  produced  by 
Streptomyces  sp.  AL-16012;  oxytetracycline  4  produced  by  S.  rimosus ;  and  chlorotetracycline  5  produced  by  S.  aureofaciens. 
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Task  1.  Elucidate  the  biosynthetic  pathway  of  SF2575  from  the  native  producer. 

Identification  and  sequencing  of  gene  cluster 


Aromatic  polyketide  biosynthetic  genes  are  typically  clustered  together  on  the  host  genomes, 
which  makes  them  ideal  for  identification  by  PCR  screening  of  cosmid  libraries8.  Degenerate  primers 
based  on  the  conserved  KSa  sequence9  were  used  to  probe  a  cosmid  library  containing  S.  sp.  SF2575 
genomic  DNA.  Several  PCR  products  from  different  cosmids  were  sequenced  and  were  found  to 
encode  an  identical  KS  fragment.  This  indicates  that  there  is  likely  only  one  aromatic  PKS  gene  cluster 
present  in  the  S.  sp.  SF2575  genome. 

Cosmid  5F-15  was  selected  and  sequenced  using  a  combination  of  shotgun  sequencing  and 
primer  walking.  Sequences  near  the  ends  of  this  genomic  fragment  were  used  to  screen  for  additional 
cosmids  containing  overlapping  sequence  information.  A  double  crossover  mutant  which  inactivated 
ssfB  was  created  to  verify  the  authenticity  of  the  gene  cluster.  The  resulting  mutant  produced  no  SF2575 
or  any  recognizable  SF2575  precursors,  which  indicates  that  the  ssf  gene  cluster  is  indeed  responsible 
for  SF2575  biosynthesis.  The  ^/biosynthetic  cluster  spans  a  47.2  kb  region  and  contains  40  ORFs 
(Figure  2).  Functions  of  the  encoded  proteins  were  assigned  by  sequence  similarity  using  NCBI  protein- 
protein  BLAST  (Table  Al).  A  putative  biosynthetic  pathway  was  then  constructed  using  the  proposed 
protein  functions.  Heterologous  biosynthesis  and  in  vitro  assays  were  used  to  confirm  sections  of  the 
proposed  pathway  including  biosynthesis  of  the  tetracycline  core  and  attachment  of  the  salicylic  acid 
ligand.  The  KS  II  gene  was  knocked  out  by  a  double  crossover  mutation  and  the  absence  of  SF2575 
production  from  the  resulting  culture  confirmed  that  this  gene  cluster  is  indeed  responsible  for  SF2575 
biosynthesis. 


SF-15 


orf-3orf-1  M4  Y4  Y3  Y1  SS  X3 

>: X  '*.>  ip 


SI  T1  B  D  orfl  orf3 


orf-2  02  V  M3  Y2  U  L2  LI  Q  01  R  N  J  H  G  S4  F  XI  S2  A  C  orf2 


+ 


OKb  IQkb  20k  b  30kb  40kb  5Qkb 

|~|  Extended  minimal  PKS  [W]  Aglycon  Tailoring  SM  Salicylic  acid  biosynthesis  [§)  Regulation  and  resistance 

I  I  Cyclases  H  Sugar  biosynthesis  OMj  Angelic  acid  biosynthesis  ■  Unknown  role  □  Boundary  proteins 

Figure  2:  Organization  of  the  ^/biosynthetic  gene  cluster.  Genes  are  categorized  according  to  their  proposed  role.  The  gene  cluster  spans 
47.2  kb  and  contains  40  ORFs.  Details  of  proposed  functions  are  shown  in  Table  1. 


Identification  of putative  intermediates  from  fermentation  broth 

In  order  to  decipher  the  timing  of  the  three  tailoring  reactions  in  the  biosynthesis  of  SF2575: 
glycosylation  with  D-olivose,  acylation  of  salicylic  acid  and  acylation  of  angelic  acid,  crude  extract  of  S. 
sp.  SF2575  culture  was  prepared  and  analyzed  on  HPLC  and  LCMS  to  try  to  identify  any  potential 
stable  intermediates  that  may  be  present  (Figure  3).  Using  selected  ion  monitoring,  the  only  potential 
intermediates  identified  were  one  with  molecular  weight  of  575  ([MTH]+  at  m/z  576,  RT  =  16.8  min) 
and  a  second  with  molecular  weight  of  695  ([M+H]+  at  m/z  696,  RT  =  23.2  min),  in  addition  to  the 
parent  compound  SF2575  ([M+H]+  at  m/z  778,  RT  =  30.1  min)  (Figure  3).  The  UV  spectrum  of  each 
putative  precursor  compound  was  similar  to  that  of  SF2575  with  a  characteristic  tetracycline  Amax  at  358 
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nm,  and  a  A,max  at  302  nm  for  SF2575  and  34  characteristic  of  the  salicylate.  The  UV  spectrum  of  33 
lacks  this  contribution  at  302  nm  resulting  in  a  smooth  peak  at  358  nm,  and  is  indicative  of  the  loss  of 
salicylate  compared  to  34  as  suggested  by  the  molecular  weight  difference.  To  confirm  the  identity  of 
these  compounds  as  shown  in  Figure  6,  authentic  standards  were  prepared  from  base  hydrolysis  of 
purified  SF2575  as  described  by  Hatsu  et.  al.10  Following  purification  of  33  and  34  from  the  base 
hydrolysis  reactions,  proton  NMR  and  HRMS  were  used  to  confirm  the  structures  by  comparison  to 
published10.  These  prepared  samples  were  then  used  as  standards  to  verify  the  identities  of  33  and  34  in 
the  fermentation  extract  of  S.  sp.  SF2575  by  HPLC  retention  time,  mass  fragmentation  pattern  and  UV 
spectra. 

Other  potential  intermediates,  such  as  33  acylated  with  angelic  acid,  or  25  acylated  with  salicylic 
acid  were  not  detected  in  the  crude  extracts.  These  evidences,  along  with  the  positive  detection  of  33 
and  34  are  consistent  with  the  proposed 
biosynthetic  pathway  of  SF2575  as  shown  in 
Figure  6.  Therefore,  the  likely  order  of  the 
tailoring  modifications  starts  with  C- 
glycosylation  of  25  with  30  that  results  in 
33,  followed  by  acylation  of  31  at  C-4  to 
produce  34,  and  capped  by  the  acylation  of 
0-4’  of  34  with  35  to  produce  the  final 
product  SF2575.  It  is  noteworthy  that  each 
of  these  isolated  intermediates  has  already 
been  O-methylated  at  the  C-12a  and  C-6 
hydroxyl  groups,  which  indicates  that  these 
modifications  likely  take  place  early  in  the 
biosynthesis,  further  confirming  25  as  the 
proposed  aglycon.  While  the  tetracyclic 
aglycon  was  not  identified  in  the 
fermentation  extract,  this  could  be  due  either 
to  the  potential  instability  of  this  compound 
or  the  rate  of  the  glycosylation  reaction 
which  may  not  permit  its  accumulation  in 
detectable  quantities. 

Biosynthesis  of  the  amidated  polyketide  backbone 

Using  the  information  gathered  from  the  fermentation  analysis,  we  constructed  a  putative 
pathway  for  SF2575  using  the  functionality  found  in  the  SF2575  gene  cluster  shown  in  figures  4  and  6. 
The  first  steps  are  the  biosynthesis  of  the  amidated  polyketide  backbone.  Biosynthesis  of  SF2575  is 
predicted  to  be  parallel  to  that  of  oxytetracycline  and  chlorotetracycline  through  the  formation  of  the 
tetracycline  core.  Examination  of  the  carbon  backbone  of  these  molecules  indicates  that  the  biosynthetic 
pathway  is  likely  identical  through  the  formation  of  4-keto-anhydrotetracycline  (4-keto-ATC).  Genes 
putatively  involved  in  the  biosynthesis  of  the  SF2575  aglycon  share  high  sequence  homology  to  many  of 
the  genes  found  in  the  recently  elucidated  oxytetracycline  (oxy)  biosynthetic  pathway5,  n.  As  in  the 
oxytetracycline  pathway,  amidated  polyketide  backbone  is  produced  by  the  minimal  polyketide  synthase 
(PKS).  The  ssf  minimal  PKS  consists  of  SsfA  (KS),  SsfB  (CLF),  and  SsfC  (ACP)  and  are  highly 
homologous  to  the  oxy  minimal  PKS5.  The  amidotransferase  SsfD,  which  is  responsible  for  producing 


MW  =  575  MW  =  695  MW  =  777 


5  10  15  20  25  30  35  40 

time  (min) 


Figure  3  HPLC  analysis  of  the  S.  sp.  SF2575  extract  after  7 
days  growth  on  solid  Bennette’s  media.  The  main  product  1, 
along  with  putative  intermediates  33  and  34  are  indicated.  The 
UV  spectra  of  these  compounds  are  shown  for  comparison. 
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the  malonamate  starter  unit  unique  to  tetracyclines,  is  found  adjacent  to  the  ssf  minimal  PKS.  The  C-2 
amidated  starter  unit  is  one  of  the  signature  moieties  of  tetracycline,  and  the  revelation  of  SsfD  as  an 
OxyD  homolog  with  68%  sequence  identity  was  one  of  the  earliest  convincing  pieces  of  evidence  that 
this  gene  cluster  indeed  encodes  a  tetracycline  compound.  Zhang  and  coworkers  demonstrated  the 
presence  of  OxyD  was  essential  for  incorporation  of  the  C-2  amide5.  These  four  enzymes,  SsfABCD, 
complete  the  ‘extended  minimal  PKS’  sufficient  for  producing  the  amidated  polyketide  chain.  In 
addition,  SsfU,  which  is  76%  identical  to  OxyJ,  is  predicted  to  be  the  C-8  ketoreductase  that 
regioselectively  reduces  the  nascent  backbone  as  shown  in  figure  47. 

These  function  of  these  genes  was  verified  experimentally  through  heterologous  reconstitution. 
SsfABCDU  were  cloned  into  a  pRM5  derived  shuttle  vector  to  produce  pLP27  and  expressed  in  S. 
lividans  K4-1 14.  The  product  profile  of  K4-1 14/pLP27  was  indistinguishable  from  that  of  K4- 
1 14/pWJ35,  which  contains  the  corresponding  enzymes  from  the  oxy  biosynthetic  pathway5.  By 
comparison  to  an  authentic  standard,  the  major  product  of  the  extract  (RT  =13.5  min)  was  confirmed  to 
be  the  isoquinolone  WJ35  (20  mg/L),  which  can  form  via  the  spontaneous  cyclization  of  the  reduced 
amidated  polyketide  9.  We  further  investigated  whether  the  oxy  and  ssf  extended  minimal  PKS 
components  are  functionally  interchangeable.  Three  additional  shuttle  vectors  were  prepared  with  a 
combination  of  oxy  and  ssf  genes  and  transformed  into  K4-1 14.  Each  of  these  host/vectors  combinations 
were  capable  of  producing  WJ35  in  similar  quantities  as  K4-1 14/pLP27,  indicating  that  the  ssf 
components  are  functionally  compatible  and  equivalent  to  their  oxy  counterparts.  Importantly,  these 
results  confirm  i)  the  extended  ssf  minimal  PKS  (SsfABCD)  is  capable  of  synthesizing  the  full  length 
amidated  polyketide  precursor;  ii)  the  hypothesis  that  the  carbon  backbone  of  SF2575  is  biosynthesized 
via  a  tetracycline-like  pathway;  and  iii)  SsfU  as  the  ketoreductase. 


Figure  4.  Biosynthesis  of  the  SF2575  aglycon  25 
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6-methylpretetramid  (6-MPT)  is  a  characteristic  stable  intermediate  identified  in  tetracycline 
producing  strains.  Again,  here  we  expected  the  formation  of  6-MPT  during  SF2575  biosynthesis  to 
parallel  the  oxytetracycline  pathway.  During  oxytetracycline  biosynthesis,  the  steps  following  the 
extended  minimal  PKS  and  reduction  by  C-8  KR  are  regioselective  cyclization  by  OxyK  and  OxyN 
followed  by  C-6  methylation  to  achieve  6-MPT  biosynthesis.  Unlike  the  oxy  cluster,  however,  the  ssf 
gene  cluster  contains  four  genes  that  have  homology  to  cyclases.  SsfY  1  is  highly  similar  to  OxyK  (67% 
sequence  similarity)  and  is  predicted  to  be 
responsible  for  D  ring  cyclization  to  afford  12. 

Indeed,  cotransformation  of  a  pSET152  derived 
integrating  vector  pLP77  containing  SsfYl  with 
pLP27  into  K4-1 14  resulted  in  the  biosynthesis 
of  12,  confirming  the  role  of  SsfY  1 . 

Intriguingly,  an  additional  cyclase,  SsfY3,  also 
displays  sequence  similarity  to  first  ring 
cyclase/aromatases,  with  39%  similarity  to 
OxyK.  However  cotransformation  of  a  SsfY3 
containing  vector  with  pLP27  did  not  result  in 
first  ring  aromatization,  indicating  that  SsfY3 
cannot  functionally  replace  SsfYl  as  first  ring 
aromatase/cyclase.  Of  the  remaining  two,  SsfY2 
is  homologous  to  second  ring  cyclases  from 
several  aromatic  polyketide  pathways.  Although 
SsfY2  has  low  sequence  homology  (25% 
similarity  and  14.5%  identity)  to  the  second  ring 
cyclase  from  the  oxy  pathway  (OxyN),  it 
displays  stronger  homology  to  second  ring 
cyclases  from  benzoisochromanequinone  (BIQ) 
pathways  such  as  granaticin  (68%  similarity, 

56%  identity)12  and  medermycin  (66% 
similarity,  55%  identity)13.  SsfY2  was  therefore 
predicted  to  be  the  second  ring  cyclase  in  the  ssf 
pathway.  The  last  cyclase  SsfY4  is  similar  to 
Oxyl,  itself  similar  to  fourth  ring  cyclase  MtmX 
from  the  mithramycin  pathway,  however  Oxyl 
was  found  to  be  uninvolved  in  the  biosynthesis 
of  oxytetracycline7. 

In  an  attempt  to  fully  reconstitute  the 
biosynthesis  of  6-methylpretetramid 
intermediate,  SsfY2  and  C-6  methyltransferase 
SsfM4  were  added  to  pLP77  and  cotransformed 
with  pLP27  into  K4-1 14.  The  resulting  strain 
however  produced  only  three  ringed  compounds 
previously  identified  as  WJ83Q2  and  WJ83Q1.  To  determine  if  the  ssf  pathway  requires  the  additional 
cyclases  found  in  its  gene  cluster  towards  the  formation  of  16,  pLPl  15  and  pLPl  18  were  constructed, 
which  contained  in  addition  to  genes  encoded  in  pLPl  13,  ssfY4,  and  ssfY3Y4,  respectively.  K4-1 14 
contransformed  with  pLP27  and  these  constructs  continued  to  produce  14  only,  suggesting  that  for  the 
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Figure  5:  Heterologous  reconstitution  of  early  biosynthetic 
pathway  S.  lividans  K4-1 14.  (A)  HPLC  analysis  (245  nm) 
of  the  K4-114/pLP27  extract  shows  the  amidated,  reduced 
polyketide  11  is  the  major  product,  confirming  the 
biosynthesis  of  the  polyketide  backbone  7  by  SsfABCD 
and  the  subsequent  C-9*  reduction  by  SsfU.  (B)  HPLC 
analysis  (253  nm)  of  the  K4-114/pLP27/pLP77  extract 
shows  addition  of  the  putative  cyclase  SsfYl  leads  to 
complete  cyclization  and  aromatization  of  the  D  ring  and 
formation  of  the  shunt  benzopyrone  13.  (C)  HPLC  analysis 
(430  nm)  of  K4-1 14/pLP27/pLP126  extract  shows  19,  the 
oxidized  form  of  18,  as  the  dominant  product. 
Biosynthesis  of  18  using  entirely  ssf  genes 
(SsfABCDUYlY2M4L2)  indicates  the  tetracycline  nature 
of  the  ^/biosynthetic  pathway.  (D)  HPLC  analysis  (395 
nm)  of  K4-114/pLP36  extract  confirms  the  function  of 
Ssf02  as  the  oxygenase  that  dihydroxylated  C-4  and  C-12a 
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ssf  pathway,  different  enzyme(s)  may  be  needed  to  facilitate  cyclization  of  the  fourth  ring.  We  tested 
this  possibility  by  inserting  additional  ssf  genes  juxtaposed  to  the  ssf  cyclase  genes  in  the  cluster  into 
K4-1 14  and  examined  the  metabolic  profiles.  Fortuitously,  upon  insertion  of  ssfL2  into  pLPl  13  to 
afford  pLP126,  and  cotransformation  of  pLP27  and  pLP126  into  K4-1 14,  the  formation  of  19  was 
detected  as  shown  in  Figure  5D.  SsfL2  has  homology  to  long-chain  fatty  acyl-CoA  ligases  as  well  as 
putative  acyl-CoA  ligases  from  several  aromatic  polyketide  pathways  such  as  CmmLII  from  the 
chromomycin  gene  cluster14,  PokL  from  the  polyketomycin  gene  cluster15,  MtmL  from  the  mithramycin 
gene  cluster16,  as  well  as  OxyH5.  The  role  of  these  enzymes  in  their  respective  biosynthetic  pathways 
has  not  been  determined  and  has  been  suggested  to  function  as  an  accessory  protein14.  Interestingly, 
biosynthesis  of  these  compounds  all  require  the  equivalent  Claisen  cyclization  of  the  A  rings.  The 
mechanism  of  SsfL2  in  the  heterologous  biosynthesis  of  18  with  .s.s/  PKS  components  is  yet  to  be 
determined. 

Tailoring  steps  leading  to  the  putative  SF25 75  aglycon 

During  oxytetracycline  biosynthesis,  the  next  step  following  the  formation  of  6-MPT  is  the  dual 
hydroxylation  of  the  A  ring  by  OxyL  at  C-12a  and  C-4  to  produce  the  cyclohexenone  containing  4-keto- 
anhydrotetracycline  (4-keto-ATC)  20,  which  is  unstable  and  degrades  into  observed  product  WJ135  216. 
Ssf02,  which  is  found  at  the  boundary  of  the  ssf  pathway,  is  an  OxyL  homolog  and  is  most  likely  to 
perform  this  role  in  the  ssf  pathway.  4-keto-ATC  is  likely  the  last  common  intermediate  between  the 
oxy  and  the  ^/biosynthetic  pathways.  In  the  oxy  pathway,  reductive  amination  of  the  C-4  ketone  group 
by  OxyQ  to  yield  4-amino-ATC  26  and  subsequent  dimethylation  by  OxyT  afford  the  stable 
intermediate  anyhydrotetracycline  276.  The  C-4  position  in  1  lacks  the  dimethylamino  group  and  is 
instead  acylated  with  a  salicylic  acid  group.  This  requires  ketoreduction  of  20  at  C-4  to  produce  4- 
hydroxyl-ATC  22  for  the  downstream  salicylation  reaction.  To  determine  whether  oxygenase  OxyL 
installs  the  hydroxyl  group  directly  or  if  an  additional  ketoreductase  is  necessary,  oxyL  gene  was 
replaced  with  ssf02  in  the  shuttle  plasmid  pWJ135,  which  contains  all  the  oxy  genes  required  to 
synthesize  20  (Table  2)6.  K4-1 14  transformed  with  the  resulting  plasmid  pLP36  produced  the  tricyclic 
ketone  21  (Figure  5D),  which  is  a  spontaneously  degraded  product  of  20  as  observed  previously6.  Then, 
the  genes  oxyQ  and  oxyT  were  introduced  to  pLP36  to  afford  pLP75  (Table  2).  As  expected,  K4- 
1 14/pLP75  produced  ~  10  mg/L  of  ATC  as  the  dominant  product,  demonstrating  that  OxyL  and  SsfL2 
catalyze  the  identical  transformation  of  18  to  20.  These  results  also  support  the  proposal  that  a  dedicated 
C-4  ketoreductase  is  required  to  reduce  20  to  22  and  reveal  the  reactive  hydroxyl  nucleophile  for  the 
subsequent  salicylation  modification.  SsfF  is  a  ketoreductase  with  sequence  homology  to  NADPH- 
dependent  oxidoreductases  with  specificity  for  aromatic  and  nonpolar  substrates  (PF00248),  and  is 
therefore  assigned  as  a  candidate  to  catalyze  this  step. 

The  last  tailoring  enzyme  in  the  ssf  pathway  that  resemblances  to  enzymes  found  in  oxy  pathway 
is  SsfOl  which  has  54%  similarity  to  OxyS.  SsfOl  therefore  likely  plays  the  parallel  role  to  oxidize  the 
aromatic  C-ring  in  22  to  23.  Two  O-methyltransferases  are  then  needed  to  methylate  the  C-6  and  the  C- 
12a  hydroxyl  groups.  There  are  three  methyltransferases  (other  than  the  C-6  methyltransferase  SsfM4) 
present  in  the  ssf  gene  cluster.  SsfMl  and  SsfM2  bear  strong  resemblance  to  O-methyltransferases, 
while  SsfM3  is  similar  to  both  O-methyltransferases  and  C-methyltransferases.  It  is  likely  that  two  of 
these  are  responsible  for  the  O-methylation  of  C-12a  and  C-6  hydroxyls.  The  gene  encoding  SsfM2  is 
transcribed  from  the  same  operon  that  encodes  ssfOl  and  is  therefore  the  likely  C-6  O- 
methyltransferase.  In  the  same  operon  is  the  gene  ssfQ,  which  encodes  an  S-adenosylmethionine  (SAM) 
synthases  and  likely  serves  as  an  auxiliary  role  to  produce  sufficient  SAM  for  the  methyltransfer 
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reactions.  It  is  known  that  6-deoxy  analogs  of  tetracyclines,  such  as  doxycycline,  have  enhanced 
stability  toward  acid  and  base  degradation  .  The  C-6-OH  methylation  may  have  a  similar  stabilizing 
effect  on  SF2575.  Once  identified,  the  C-6  O-methyltransferase  may  potentially  be  useful  to  generate 
C-6-methoxytetracycline  analogs  that  have  enhanced  stability. 

The  last  step  in  the  biosynthesis  of  oxytetracycline  is  the  reduction  of  5a,  1  la- 
dehydrooxytetracycline  to  oxytetracycline.  This  reductive  modification  required  for  the  biosynthesis  of 
chlorotetracycline  and  oxytetracycline  has  been  postulated  to  involve  a  gene  located  outside  their 
respective  gene  clusters18.  The  tchA  gene  encodes  a  flavin  dependent  oxidoreductase  and  has  been 
identified  in  S.  aureofaciens  to  be  involved  for  the  biosynthesis  of  519.  An  identical  reduction  step  is 
also  needed  to  complete  the  biosynthesis  of  aglycon  25,  from  for  example,  24.  One  candidate  for  this 
reaction  in  the  ssf  gene  cluster  is  a  putative  dehydrogenase  SsfP.  Experimental  evidence  will  be  needed 
to  determine  if  this  reduction  step  is  catalyzed  by  an  enzyme  encoded  in  the  ssf  gene  cluster  such  as  SsfP 
or  if  an  external  gene  is  required  as  in  chlorotetracycline  biosynthesis. 

Genes  involved  in  the  biosynthesis  of  D-olivose 

Deoxysugar  decoration  is  an  important  tailoring  modification  of  bacterial  secondary  metabolites 
and  is  usually  essential  for  the  bioactivity  of  the  natural  products20.  The  C-9  of  1  is  modified  with  a  C- 
glycoside  D-olivose,  which  is  also  found  among  other  polyketides  such  as  mithramycin21,  landomycin 
A22,  urdamycin  A23,  and  simocyclinone24  etc.  Putative  enzymes  encoded  in  the  ssf  gene  cluster  that  can 
convert  glucose  to  NDP-D-olivose  are  SsfSl  (NDP-glucose  synthase),  SsfS2  (4’,6’-dehydratase),  SsfS3 
(2’, 3’ -dehydratase),  SsfS4  (C-3’  reductase),  and  SsfS5  (C-4’  reductase).  NDP-D-olivose  is  subsequently 
transferred  to  the  tetracycline  aglycon  25  by  a  C-glycosyltransferase  to  yield  33  (Figure  6).  C- 
glycosyltransferases  are  more  rare  than  O-glycosyltransferases,  and  are  found  in  a  number  of  aromatic 
PKS  biosynthetic  gene  clusters  such  as  urdamycin  ,  hedamycin  ,  and  simocyclinone  .  The  putative 
ssf  glycosyltransferase,  SsfS6,  has  high  sequence  homology  to  HedJ,  a  C-glycosyltransferase  from  the 
hedamycin  biosynthetic  pathway25.  Since  SsfS6  is  the  first  glycosyltransferase  that  can  modify  the 
tetracycline  scaffold,  it  can  potentially  be  a  useful  enzyme  towards  increasing  the  structural  diversity  of 
the  D-ring  of  tetracycline  compounds. 

Genes  involved  in  the  biosynthesis  of  angelyl-CoA 

Little  has  been  reported  on  the  biosynthesis  of  angelic  acid  among  bacterial  natural  products, 
therefore  there  is  not  a  well  established  biosynthetic  pathway.  Angelate  is  commonly  found  to  be 
associated  with  plant  metabolites  and  is  biosynthesized  from  the  isoleucine  catabolic  pathway26, 21 .  No 
genes  were  found  in  the  ssf  cluster  that  indicate  a  plant-like  pathway  for  this  unusual  substitution  in 
SF2575.  Instead,  a  set  of  genes  encoding  enzymes  involved  in  the  formation  and  tailoring  of  a  short 
chain  acyl  group  was  found.  These  genes  include  SsfE  (carboxytransferase),  SsfJ  (enoyl-CoA 
hydratase/isomerase),  SsfN  (KSIII),  and  SsfK  (3-ketoacyl-ACP  reductase).  These  genes  resemble  those 
found  in  fatty  acid  biosynthesis  and  may  be  responsible  for  synthesizing  angelyl-CoA  from  a  propionyl- 
CoA,  or  other  short  chain  acyl  building  block.  One  possible  pathway  to  produce  angelate  from  genes 
found  in  the  ssf  pathway  is  shown  in  figure  6. 

SsfE  contains  a  carboxyltransferase  domain  (PF01039)  and  is  homologous  to  biotin-dependent 
methylmalonyl-CoA  decarboxylase  and  propionyl-CoA  carboxylase.  One  possible  role  for  SsfE  is  to 
synthesize  methylmalonyl-CoA  from  propionyl-CoA.  The  next  set  of  steps  in  the  biosynthesis  of  35  are 
proposed  to  catalyzed  by  ssfK,  ssfN  and  ssfJ,  which  are  cotranscribed  as  a  tricistronic  cassette  on  the 
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same  operon.  SsfN  is  homologous  to  KS  III  enzymes  such  as  FabH,  which  catalyzes  the  condensation 
between  acetyl-CoA  and  malonyl-ACP  to  initiate  fatty  acid  biosynthesis28.  FabH  homologs  have  been 
found  in  a  number  of  polyketide  gene  clusters  such  as  in  R1 128  ,  frenolicin  ,  and  daunorubicin  to 
catalyze  the  decarboxylative  condensation  of  short  acyl  groups.  SsfN  may  therefore  catalyze  the 
condensation  of  methylmalonyl-CoA  with  acetyl-CoA  to  form  2-methyl-acetoacetyl-CoA.  A  P- 
ketoreduction  catalyzed  by  SsfK,  which  is  homologous  to  3-oxoacyl-ACP  reductases  such  as  FabG31, 
yields  3-hydroxyl-2-methyl-butyryl-CoA.  Stereospecific  dehydration  by  SsfJ,  which  is  a  member  of  the 
enoyl-CoA  hydratase/isomerase  family  (PF00378),  results  in  angelate.  As  an  alternative  start  to  this 
pathway,  SsfN  may  directly  condense  malonyl-CoA  with  acetyl-CoA  to  form  acetoacetyl-CoA.  The  di¬ 
methyl  group  may  then  be  incorporated  by  SsfM3,  which  is  homologous  to  C-methyltransferases,  to 
afford  2-methyl-acetoacetyl-CoA. 
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Figure  6:  Proposed  biosynthetic  pathways  for  the  pendants  30, 32  and  35,  and  the  conversion  of  25  to  1.  The  order  of  these 
modifications  is  proposed  to  be  glycosylation  of  25  to  form  33,  sialycylation  of  the  C-4  hydroxyl  of  33  to  form  34,  and 
angelycylation  of  D-olivose  to  form  1.  The  proposed  intermediates  33  and  34  have  been  detected  in  the  extract  of  S.  sp.  SF2575 
culture  by  LCMS. 


Genes  involved  in  the  biosynthesis  of  salicylate 

The  C-4  salicylate  substitution  in  1  is  unusual  not  just  among  tetracyclines  but  quite  rare  among 
aromatic  polyketides.  One  known  example  is  that  of  thermorubin,  in  which  the  salicylate  moiety  is 
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thought  to  be  incorporated  as  a  starter  unit32.  Sequence  analysis  of  ssf  gene  cluster  revealed  a  likely 
mechanism  of  salicylate  synthesis  from  the  shikimate  pathway  (Figure  6),  similar  to  that  utilized  in  the 
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biosynthesis  of  siderophores  yersinobactin  and  mycobactin  ’  .  SsfH  was  found  to  be  homologous  to 
the  salicylate  synthase  genes  Irp933  and  Mtbl34  from  yersinobactin  and  mycobactin  biosynthesis 
respectively,  which  convert  chorismate,  a  byproduct  of  the  shikimate  pathway,  to  salicylate  in  a  two  step 
reaction  that  proceeds  through  an  isochorismate  intermediate.  In  addition,  the  ssf  gene  cluster  contains 
ssfl  that  encodes  a  3-deoxy-d-araZ>mo-heptulosonate-7-phosphate  (DAHP)  synthase  that  condenses 
erythrose-4-phosphate  and  phosphoenolpyruvate  into  DAHP36,  which  is  the  first  step  in  the  shikimate 
pathway.  Genes  encoding  the  remaining  of  the  shikimate  pathway  that  converts  DAHP  to  chorismate 
are  absent  from  the  ssf  gene  cluster  and  are  likely  shared  with  the  endogenous  metabolism  of  S.  sp. 
SF2575.  The  ssfl  gene  is  therefore  likely  an  extra  copy  dedicated  for  the  biosynthesis  of  1  and  serves  to 
direct  carbon  flow  through  the  shikimate  pathway  when  SF2575  biosynthesis  is  induced. 

Attachment  of  31  to  the  aglycon  is  proposed  to  occur  through  an  ATP-dependent  activation  of 
salicylate  to  salicylyl-CoA  32  by  a  salicylyl-CoA  ligase,  and  subsequent  transfer  catalyzed  by  an 
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acyltransferase.  SsfLl  has  high  sequence  homology  to  NcsB2  andMdpB2  ,  which  catalyze  formation 
of  the  CoA-ester  of  naphthanoic  acid  and  3,6-dimethylsalicylic  acid  in  the  enediyne  biosynthetic 
pathways  of  neocarzinostatin  and  maduropeptin,  respectively.  To  reconstitute  the  synthesis  of  salicyl- 
CoA,  putative  salicylyl-CoA  ligase  SsfLl  was  expressed  from  E.  coli  BL21(DE3)  as  an  A-terminal 
hexahistidine  tag  fusion  protein  and  purified  using  Ni-NTA  chromatography.  ATP  and  salicylic  acid 
were  added  to  SsfLl  and  the  release  of  pyrophosphate  (PP;)  was  monitored  spectrophotometrically  at 
340  nm  using  a  coupled  assay  that  oxidizes  NADH.  As  shown  in  Figure  7A,  PP;  release  was  observed 
only  in  the  presence  of  SsfLl,  ATP  and  salicylic  acid.  To  demonstrate  the  CoA  ligase  properties  of 
SsfLl,  enzymatic  synthesis  of  salicyl-CoA  was  performed  by  introducing  CoA  into  the  assay.  After 
overnight  incubation,  a  new  compound  consistent  with  the  retention  time  of  salicyl-CoA  was  purified 
from  the  reaction  mixture  by  HPLC  and  mass  analysis  confirmed  the  identity  ([MTH]+  at  m/z  =  888). 

Task  2.  Probe  the  mechanism  of  key  enzymes  involved  in  the  biosynthesis  of  SF2575. 


Since  the  D-olivose  biosynthetic  pathway  has  been  well  established,  we  sought  to  investigate  the 
more  unclear  aspects  of  SF2575  biosynthesis,  the  attachment  of  the  unique  acyl  groups.  There  are  two 
predicted  acyltransferase  enzymes  in  the  ssf  gene  cluster,  SsfX3  and  SsfV.  In  vitro  experiments  have 
verified  SsfX3,  which  has  weak  sequence  homology  to  the  GDSL  lipase  family39,  as  the  salicyl 
acyltransferase.  To  probe  the  potential  reaction  catalyzed  by  SsfX3,  we  first  assayed  the  reverse 
hydrolysis  reaction  using  different  products.  SF2575  was  treated  with  purified  SsfX3  overnight  and  the 
assay  mixture  was  analyzed  by  LCMS.  The  result  showed  the  presence  the  parent  compound  SF2575 
and  a  second  compound  with  mass  corresponding  to  the  loss  of  salicylate.  A  control  reaction  without 
SsfX3  did  not  afford  this  compound,  as  determined  by  selected  ion  monitoring.  The  UV  of  this 
compound  showed  Amax  of  358  nm  and  the  loss  of  the  shoulder  at  302  nm,  consistent  with  the  hydrolysis 
of  salicylic  acid.  This  result  was  the  first  indication  that  SsfX3  may  be  involved  in  C-4  salicylation.  To 
confirm  the  SsfX3-catalyzed  acyltransfer  reaction,  an  in  vitro  assay  was  performed  in  which  SsfX3  and 
SsfLl  were  added  to  33  with  salicylic  acid,  ATP  and  free  CoA.  The  reaction  was  incubated  for  30 
minutes  at  room  temperature  and  analyzed  by  HPLC  (Figure  7B).  The  reaction  containing  all 
components  showed  nearly  quantitative  conversion  of  33  to  34  (Figure  7B,  trace  iii).  Omission  of  any 
of  the  enzyme,  substrate,  or  cofactor  component  of  the  assay  did  not  lead  to  the  synthesis  of  34,  as 
shown  in  Figure  7B  traces  iv-viii.  Together,  these  results  confirmed  that  SsfX3  is  responsible  for  the 
conversion  of  33  to  34  using  32  as  the  acyl  donor.  Since  GSDL  family  enzymes  are  often  found  to  have 
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broad  substrate  specificity,  which  is  proposed  to  be  the  result  of  a  flexible  substrate  binding  pocket39, 
investigating  the  substrate  tolerance  of  SsfX3  may  be  a  useful  avenue  for  diversifying  the  C-4 
functionality  of  SF2575. 


Task  3.  Reconstitute  the  biosynthesis  of 

SF2575. 

Due  to  the  large  size  of  the 
SF2575  biosynthetic  gene  cluster,  we 
were  unable  to  find  a  cosmid  that 
contained  the  entire  gene  cluster.  Efforts 
are  ongoing  to  clone  each  of  two 
cosmids  that  span  the  entire  gene  cluster 
into  two  different  Streptomyces  shuttle 
vectors  that  can  be  transformed 
simultaneously  into  a  heterologous  host, 
however  this  has  not  yet  been  successful. 
In  place  of  heterologous  biosynthesis,  we 
have  shifted  efforts  toward  creating 
genetic  knockouts  to  investigate  the 
biosynthesis  and  create  analogs  of 
SF2575.  With  the  successful 
construction  of  a  ssfB  knockout,  we 
believe  this  strategy  may  be  applied  to 
many  additional  genes.  We  have  begun 
to  knockout  ssfH  in  order  to  create  a 
mutant  that  is  deficient  in  salicylate 
biosynthesis  which  may  be  used  to  create 
SF2575  analogs  by  mutasynthesis. 

Task  4.  Test  the  antitumor  activities  of 

SF2575. 

Structure-activity  relationship  bioactivity 
assays 

The  angelate  and  salicylate 
modifications  of  SF2575  are  unique 
structural  features  among  tetracyclines 
and  bacterial  aromatic  polyketides.  To 
probe  whether  these  moieties  are  the 
“warheads”  that  contribute  to  the 
antitumor  activities  of  SF2575,  we 
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Figure  7:  Reconstitution  of  the  synthesis  and  transfer  of  salicylate 
to  the  aglycon.  (A)  Synthesis  of  salicylyl-AMP  as  indicated  by  the 
release  of  PPi  in  the  presence  of  ATP,  salicylate  and  putative 
salicylyl-CoA  ligase  SsfLl.  Black  squares:  all  reaction  components; 
black  triangles:  no  salicylate;  open  circles:  no  ATP;  open  squares: 
no  SsfLl;  and  grey  diamonds:  pyrophosphate  reagent  only.  (B)  The 
tandem  actions  of  SsfLl  and  SsfX3  transfers  salicylate  to  the 
aglycon  substrate  33  to  yield  34.  The  assays  are  performed  in 
50mM  HEPES,  pH  7.9  and  10  mM  MgCl2.  i)  the  semisynthetic  33 
standard;  ii)  the  semisynthetic  34  standard;  iii)  Complete  reaction 
containing  50  mM  HEPES  pH  7.9,  10  mM  MgCl2,  2  mM  ATP,  2 
mM  free  CoA,  2  mM  31,  20  pM  33,  1.5  pM  SsfX3  and  15  pM 
SsfLl.  Control  reactions  were  performed  as  iii)  with  the  following 
exclusions:  iv)  no  SsfLl;  v)  no  ATP;  vi)  no  31;  vii)  no  SsfX3;  and 
viii)  no  CoA.  All  reactions  were  performed  at  25 °C  for  30  minutes, 


performed  in  vitro  cytotoxicity  assays 
with  Nalm-6  pre-B  cells  using  SF2575  with  both  acyl  groups  removed  33,  SF2575  with  salicylic  acid 
group  hydrolyzed  34  and.  While  the  parent  compound  SF2575  had  a  potent  IC50  of  8.8  nM,  removal  of 
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the  angelate  resulted  in  a  significantly  attenuated  IC50  of  327  nM  for  34.  Further  hydrolysis  of  the 
salicylate  as  in  33  led  to  an  additional  15 -fold  decrease  of  potency  with  an  IC50  of  5.2  pM.  Similar 
trends  were  observed  when  other  cell  lines,  including  MCF-7,  HeLa,  and  M249  were  subjected  to  the 
cytotoxicity  assays.  Hence,  both  pendant  groups  are  critically  important  for  the  bioactivity  of  SF2575 
and  are  attractive  targets  for  structural-activity-relationship  studies.  Tailoring  enzymes  identified  here 
SsfLl  and  SsfX3  responsible  for  the  salicylate  addition,  and  further  investigation  into  the  mechanism  of 
addition  of  angelate  may  therefore  provide  useful  biosynthetic  approaches  towards  functionalizing  these 
positions,  especially  considering  the  densely  functionalized  tetracycline  core  may  be  difficult  to  assess 
synthetically. 


LogfICso)  Std.  Error 


Cell  line 

Type 

1 

34 

33 

Nalm-6 

human  B  cell  precursor  leukemia 

-8.056  ±0.0518 

-6.485  ±  0.0753 

-5.285  ±  0.0958 

HeLa 

human  cervix  carcinoma 

-7.466  ±  0.0769 

-6.196  ±0.153 

>-5.0 

MCF-7 

human  breast  adenocarcinoma 

-7.619  ±  0.1 12 

-5.068  ±0.0813 

>-5.0 

M249 

human  melanoma 

-8.134  ±0.144 

-6.788  ±0.0921 

>-5.0 

Table  1:  Cytotoxicity  of  SF2575  and  derivatives 


Key  Research  Accomplishments 

•  Gene  cluster  responsible  for  the  biosynthesis  of  tetracycline  family  anticancer  compound  SF2575 
has  been  identified  and  sequenced  from  the  genome  of  native  host  Streptomyces  sp.  SF2575 

•  Bioinformatic  analysis  of  genes  found  in  the  ssf  gene  cluster  as  well  as  analysis  of  intermediates 
in  S.  sp.  SF2575  fermentation  broth  has  been  used  to  construct  a  putative  pathway  for  the 
biosynthesis  of  SF2575 

•  Gene  cluster  has  been  verified  through  heterologous  reconstitution  of  tetracycline  producing 
genes  through  the  biosynthesis  of  characteristic  intermediate  6-methylpretetramid 

•  Enzymes  responsible  for  the  unique  addition  of  salicylic  acid  to  the  A  ring  of  SF2575  have  been 
positively  identified  and  characterized  through  in  vitro  analysis. 

•  SAR  studies  have  been  conducted  by  testing  the  bioactivity  of  authentic  SF2575  and  variants 
produced  by  removal  of  the  angelate  ligand  and  both  the  angelate  and  salicylate  ligands.  MTS 
assay  using  four  different  cell  lines  showed  a  marked  decrease  in  bioactivity  following  the 
removal  of  the  angelate  and  even  further  decrease  by  the  removal  of  both  acyl  groups 

Reportable  Outcomes 

•  Manuscript  in  preparation 

Lauren  B.  Pickens,  Peng  Wang,  Hui  Zhou,  Woncheol  Kim,  Kenji  Watanabe,  Shuichi  Gomi,  Yi  Tang 
“Biochemical  Analysis  of  the  Biosynthetic  Pathway  of  an  Anticancer  Tetracycline  SF2575”. 

•  Presentations: 

Pickens,  L.  B.,  Tang,  Y.,  “  Investigating  the  biosynthesis  of  a  tetracycline  analog  with  antitumor 
properties”.  International  Conference  on  Biological  Engineering,  Jan  18-21,  2009,  Santa  Barbara,  CA 
(poster) 
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Pickens,  L.  B.,  Tang,  Y.,  “  Investigating  the  biosynthesis  of  a  tetracycline  analog  with  antitumor 
properties”.  AICHE  annual  meeting,  Nov  17-21,  2008,  Philadelphia,  PA  (presentation) 

•  GenBank  submission 

SF2575  gene  cluster  deposited  to  GenBank.  Accession  number:  GQ409537 

Conclusions 

We  have  identified  and  sequenced  the  gene  cluster  responsible  for  the  biosynthesis  of 
tetracycline  family  anticancer  compound  SF2575  from  native  host  S.  sp.  SF2575.  Using  heterologous 
biosynthesis  we  have  verified  the  tetracycline  nature  of  the  gene  cluster  by  reconstituting  the 
biosynthesis  of  key  early  intermediates.  We  have  identified  and  characterized  key  tailoring  enzymes 
responsible  for  the  salicylic  acid  modification.  These  enzymes  are  unique  to  tetracycline  biosynthetic 
pathways.  Initial  substrate  specificity  experiments  with  these  enzymes  indicate  that  they  may  potentially 
be  exploited  to  generate  improved  analogs  of  this  already  potent  anticancer  compound,  as  well  as  the 
antibiotic  tetracyclines. 
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Supporting  Data 


Name 

Closest  homolog 

Putative  Role 

Size 

Identity/ 

Similarity 

Accession 

number 

SsfA 

OxyA 

Keto  synthase 

45  kDa 

76%  /  86% 

CAA80985 

SsfB 

OxyB 

Chain  length  factor 

43  kDa 

64%  /  77% 

AAZ78326 

SsfC 

OxyC 

Acyl  carrier  protein 

9.5  kDa 

54%  /  73% 

AAZ78327 

SsfD 

OxyD 

Amidotransferase 

68  kDa 

68%  /  79% 

AAZ78328 

SsfE 

ChlJ 

Carboxytransferase 

58  kDa 

81%/ 89% 

AAZ77684 

SsfF 

Npun  F4742 

Ketoreductase 

34  kDa 

55%  /  70% 

ACC83094 

SsfG 

ChlB3 

Ketosynthase  III 

36  kDa 

37%  /  56% 

DQ1 16941 

SsfH 

Mbtl 

Salicylate  synthase 

48  kDa 

42%  /  57% 

Q7D785 

Ssfl 

Plml 

DAHP-7-phosphate  synthase 
Enoyl  CoA 

45  kDa 

56%  /  67% 

AY354515 

SsfJ 

SAV  2786 

hy  dratase/ isomerase 

28  kDa 

61%/ 75% 

BAC70497 

SsfK 

Athe  1156 

3-Ketoacyl-(ACP)  reductase 

26  kDa 

38%/ 59% 

ACM60257 

SsfLl 

SdgA 

Salicyl-AMP  ligase 

58kDa 

75%  /  84% 

BAC78380 

SsfL2 

OxyH 

Acyl-CoA  ligase 

56  kDa 

51%/ 67% 

DQ143963 

SsfMl 

OxyT 

O-methyl  transferase 

36  kDa 

48%  /  62% 

DQ143963 

SsfM2 

DauK 

O-methyl  transferase 

35  kDa 

38%/ 53% 

AAB16938 

SsfM3 

MtmMII 

Methyl  transferase 

37  kDa 

38%/ 55% 

CAK50790 

SsfM4 

OxyF 

C-6  methyl  transferase 

37  kDa 

51%/ 68% 

AAZ78330 

SsfN 

PpzT 

Ketosynthase  III 

31  kDa 

46%  /  63% 

CAX48662 

SsfOl 

OxyS 

Oxygenase 

55  kDa 

42%  /  54% 

AAZ78342 

Ssf02 

OxyL 

Oxygenase 

59  kDa 

57%  /  70% 

AAZ78335 

SsfP 

CminQ 

Dehydrogenase 

29  kDa 

49%/54% 

YP  118212 

SsfQ 

Lct52 

SAM  synthetase 

43  kDa 

84%  /  90% 

ABX71135 

SsfR 

Krad  0275 

MFS  transporter 

58  kDa 

33%/ 51% 

ABS01765 

SsfSl 

StrD 

dTDP  glucose  synthase 

39  kDa 

72%  /  83% 

BAG22759 

SsfS2 

CalS3 

Glucose-4, 6-dehydratase 

36  kDa 

69%  /  80% 

AAM94770 

SsfS3 

Sim20 

Hexose-2, 3 -dehydratase 

53  kDa 

58%  /  72% 

AF322256 

SsfS4 

ChlC4 

Hexose-3  -ketoreductase 

33  kDa 

49%  /  59% 

AAZ77681 

SsfS5 

NanG4 

Hexose-4-ketoreductase 

37  kDa 

42%  /  53% 

AAP42863 

SsfS6 

HedJ 

Glycosyltransferase 

41  kDa 

40%  /  55% 

AAP85354 

SsfTl 

SnorA 

Regulation 

31  kDa 

44%/ 61% 

CAA12016 

SsfT2 

SAML0351 

Regulation  (TetR  family) 

23  kDa 

48%  /  62% 

CAJ89338 

SsfU 

OxyJ 

C-9*  ketoreductase 

27  kDa 

76%  /  86% 

AAZ78333 

SsfV 

ZhuC 

Acyl  transferase 

34  kDa 

54%  /  64% 

AAG30190 

SsfXl 

Chll 

Unknown  function 

8.5  kDa 

50%  /  60% 

AAZ77683 

SsfX2 

RemJ 

Carbohydrate  kinase 

36  kDa 

64%  /  72% 

CAE51179 

SsfX3 

AviX9 

Acyltransferase 

40  kDa 

48%/ 61% 

AAK83171 

SsfYl 

OxyK 

Aromatase/ Cyclase 

34  kDa 

57%  /  67% 

AAZ78334 

SsfY2 

NcnE 

Second  ring  cyclase 

33  kDa 

56%  /  68% 

AAD20271 

SsfY3 

CminQ 

Cyclase/ aromatase 

31  kDa 

29%  /  42% 

CAE17552 

SsfY4 

Oxyl 

Hypothetical  protein 

Cyclase/ aromatase 

13  kDa 

66%  /  76% 

AAZ78332 

Orf-1 

SGR  5601 

Unknown 

12  kDa 

84%/94% 

BAG22430 

Orf-2 

SSEG  08767 

FeS  assembly  protein 

17  kDa 

89%/96% 

EDY55253 

Orf-3 

SSEG_03816 
Hypothetical  conserved 

Cysteine  desulferase 

46  kDa 

90%/94% 

EDY65440 

Orfl 

protein 

Hypothetical  protein 

Unknown 

66  kDa 

60%/69% 

EEP13034 

Orf2 

SCO3803 

Unknown 

AraC  Transcriptional 

14  kDa 

78%/88% 

NP  627922 

Orf3 

SSEG  08979 

Regulator 

35  kDa 

82%/90% 

EDY56197 

Table  A1  Putatively  assigned  functions  of  enzymes  encoded  in  the  ^/biosynthetic  gene  cluster 
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